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(Dated: March 18, 2008) 

We measure the it production cross section in pp collisions at y/s — 1.96 TeV in the lepton+jets 
channel. Two complementary methods discriminate between signal and background, 6-tagging and 
a kinematic likelihood discriminant. Based on 0.9 fb _1 of data collected by the DO detector at the 
Fermilab Tevatron Collider, we measure a t j = 7.62 ± 0.85 pb, assuming the current world average 
m t — 172.6 GeV. We compare our cross section measurement with theory predictions to determine 
a value for the top quark mass of 170 ± 7 GeV. 
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PACS numbers: 13.85.Lg, 13.85.Qk, 14.65. Ha 



The standard model fixes all properties of the top 
quark except its mass. The cross section for top quark 
production depends on the couplings of the top quark and 
on its mass. In this Letter, we report the most precise 
measurement of the top-antitop quark pair (it) produc- 
tion cross section to date. By comparing the measured 
cross section to predictions we test whether the top quark 
conforms with standard model expectations. We also for 
the first time extract a constraint on the top quark mass 
based only on this comparison. This determination of the 
top quark mass is complementary to direct measurements 
and has the advantage that it is done in a well-defined 
renormalization scheme, that employed in the calculation 
of the cross section. 

The Tevatron collides protons and antiprotons at yfs = 
1.96 TeV. Most top quarks at the Tevatron are created 
in pairs through the strong interaction, although evi- 
dence of single top quark production has been reported 
recently [lj]. For a top quark mass of 175 GeV, the stan- 
dard modelpredicts a it production cross section of about 
6.7 pb [1, Previous measurements 0, [B| agree with 
this prediction within their precision of 15%. Here we 
present an substantially improved measurement of the it 
production cross section, based on data collected by the 
DO detector Q between August 2002 and December 2005 
with an integrated luminosity of 0.9 fb -1 . 

In the standard model the top quark always decays 
to a W boson and a b quark. The decay modes of the 
W boson define the possible final states. Here we focus 
on the lepton+jets channel in which one W boson de- 
cays to ev, /iv, or tv followed by r -> evv or fivv. We 
refer to such leptons as prompt. The other W boson 
decays to jets or to tv followed by a hadronic r decay. 
The branching fraction for this channel is 38%. The DO 
detector acquires these events by triggering on an elec- 
tron or muon and at least one jet with large momentum 
component transverse to the beam direction (pr). The 
event selection Q requires exactly one electron or muon, 
that is isolated from other objects in the detector, with 
p T > 20 GeV and \rj\ < 1.1 (for e) or |r?| < 2 (for fj), 
missing transverse momentum j>T > 20 GeV (for e+jets) 
or 25 GeV (for /x+jets), and at least three jets with px > 
20 GeV and |7y| < 2.5. The pseudorapidity is defined 
as i) = — ln[tan(0/2)] and 9 is the polar angle with the 
proton beam. The leading jet must have pr > 40 GeV 
and the lepton pr and $t vectors must be separated in 
azimuth to reject background events with mismeasured 
particles. Jets are reconstructed using the Run II cone 
algorithm 8] with cone size yj (A<p) 2 + (Ay) 2 = 0.5, in 
terms of azimuth <f> and rapidity y. We call this the in- 
clusive lepton+jets sample. Table U gives the number of 
selected events (iVdata)- The expected tt signal accounts 
only for about 20% of this sample. Most events origi- 
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TABLE I: Event counts in the inclusive lepton+jets sample. 



nate from other processes that produce prompt leptons 
and jets (mostly W+jets production) and from events 
with jets which mimic the signature of a lepton. We use 
two complementary techniques to distinguish the tt sig- 
nal from these backgrounds; 6-tagging and a kinematic 
likelihood discriminant. 

We model the tt signal and all backgrounds with 
prompt leptons using Monte Carlo (MC) simulations. 
We carry out the analyses using tt events generated 
at a reference mass of 175 GeV. jy+jets and Z+jets 
production are generated using the ALPGEN [3] gener- 
ator and PYTHIA Oil for showering. A matching algo- 

S ingle 



rithm [ll| avoids double counting of final states, 
top production is generated using SINGLETOP [12j and 
COMPHEP [13[ . Diboson and it production are generated 
by pythia. All simulated events are processed by a de- 
tector simulation based on GEANT [14J and by the same 
reconstruction programs as the collider data. 

We first determine the background from events with- 
out prompt leptons in the inclusive lepton+jets sample 
using loose data samples defined by relaxing the electron 
identification and the muon isolation requirements. We 
use simulated events to determine the probability e s for 
leptons from W boson decays that satisfy the loose selec- 
tion to also pass the selection used for the measurement. 
We correct this efficiency for known differences between 
efficiencies observed in the MC simulation and in data. 
We determine the corresponding efficiency e& for misiden- 
tified leptons using data selected with the criteria given 
above except for requiring $t < 10 GeV to minimize con- 
tributions from leptons from W boson decays. The num- 
ber of events in our selected sample is A/data = N% + N# , 
where is the number of events with prompt leptons 
and JVjj the number of events without prompt leptons. 
The number of events in the corresponding loose sam- 
ple is N\ oosc — Ngj/e s + iVjj/e;,. These two equations 
determine JVjj given in Table [fl We predict the num- 
ber of events, Aether, from the smaller background pro- 
cesses (single top, Z+jets, and diboson production) using 
the MC simulation and next-to-leading order cross sec- 
tions 11511. 
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FIG. 1: Jet multiplicity spectra for e+jets and /i+jets events 
(a) with one 6-tagged jet and (b) with at least 2 6-tagged jets. 
The histogram shows (from top to bottom) the contributions 
from it production, and from backgrounds with prompt lep- 
tons and without prompt leptons. 
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TABLE II: Numbers of events in the 6-tagged analysis. 



For the 6-tag analysis, we start with the expected ti 
cross section to get a first estimate of the number of ti 
events in the sample. After we obtain a cross section 
as described below we update this estimate using the 
measured cross section and iterate the cross section cal- 
culation until the result is stable. We fix the number of 
W^+jets events in the inclusive sample so that the sum 
of all background and signal contributions equals the ob- 
served number of events. 

The 6-tag analysis enhances signal purity by requiring 
that at least one jet be tagged as a 6-jet, i.e., identified 
to contain the decay of a long-lived particle such as a 
6-hadron (l6j . We determine the number of background 
events without prompt leptons as above and the number 
of events expected from other background sources from 
the number of background events in the inclusive sample 
times their probability to be 6-tagged. We obtain the 
6-tagging probability from the MC simulation corrected 
for differences in the 6-tagging efficiencies observed in the 
simulation and in data. In order for the MC model to 
correctly predict the number of lepton+jets events with 
two jets with at least one 6-tagged jet, we have to scale 
the number of VF+jets events with heavy quarks (6, c) by 
a factor of 1.17±0.18 relative to the rest of the PF+jets 
events. We use the same scale factor for VF+^S jet 
events. Figure Q] shows the jet multiplicity spectrum of 
events with 6-tags compared to expectations. The com- 
position of the 6-tagged samples is given in Table [TTJ The 
ti contribution in Fig. [T] and Tables HI and HT1 is based on 
the cross section measured in the 6-tag analysis. 

We calculate the cross section using a maximum like- 



lihood fit [l7| to the number of events in eight differ- 
ent channels defined by lepton flavor (e, /i), jet multi- 
plicity (3, > 4), and 6-tag multiplicity (1, > 2). The 
likelihood is defined as £ = JJ i V(N i , fii(a t j)), where i 
runs over the eight channels and P(N,/j.) is the Pois- 
son probability to observe N events when fj, are ex- 
pected. The expected number of events is the sum 
of the number of events from all backgrounds plus the 
number of ti events as a function of o~ t j. We obtain 
<j t - t = 8.05 ± 0.54(stat) ± 0.70(syst) ± 0.49(lumi) pb for 
m t = 175 GeV. The third uncertainty arises from the 
measurement of the integrated luminosity [l8j |. 

Table [TTT1 lists the systematic uncertainties which arise 
from the following main categories. Selection covers ac- 
ceptance and efficiency for leptons and jets. Jet en- 
ergy calibration accounts for jet energy scale and res- 
olution. The 6-tagging efficiencies for 6, c, and light 
quark/gluon jets make up the b-tagging uncertainty. MC 
model uncertainties originate from the cross sections used 
to normalize the simulated backgrounds, differences ob- 
served between it samples generated with ALPGEN and 
pythia, the factorization and renormalization scale in 
the W+jets simulation, and the parton distributions 
functions (PDF). TVjj covers the determination of the 
number of events without prompt leptons. 

source 6-tag likelihood combined 



selection efficiency 0.26 pb 

jet energy calibration 0.30 pb 

6-tagging 0.48 pb 

MC model 0.29 pb 



likelihood fit 



0.06 pb 



0.25 pb 
0.11 pb 

0.11 pb 
0.10 pb 
0.15 pb 



0.25 pb 
0.20 pb 
0.24 pb 
0.19 pb 
0.07 pb 
0.08 pb 



TABLE III: Breakdown of systematic uncertainties. 

The likelihood analysis is based on kinematic differ- 
ences between events with ti decays and backgrounds. No 
single kinematic quantity can separate signal and back- 
ground effectively. We therefore build a likelihood dis- 
criminant from 5-6 variables, listed in Table [TVl in each 
channel. The variables were selected to be well modelled 



variable channel 

J2i=i Pr(i) +Pr(e) +#>t 
AR between lepton and jet 1 
AR between jets 1 and 2 
Acj> between lepton and $t 
A(j) between jet 1 and j/>t 
sphericity 
aplanarity 



all 

e+3 jets, e+>4 jets 
e+3 jets, e+>4 jets 
all 

e+>4 jets, /i+>4 jets 
/x+3 jets, /^+>4 jets 
e+3 jets, fi+3 jets 
all but /j+3 jets 
all but /j+3 jets 



TABLE IV: Variables used for the likelihood discriminant. 
AR = a/ (A(f>) 2 + (At/) 2 and i indexes the list of Nj jets with 
Pt > 15 GeV, ordered in decreasing pr- 



FIG. 2: Likelihood discriminant spectra for e+jets and /i+jets 
events (a) with 3 jets and (b) with at least 4 jets. See also 
the caption of Fig. [T] 





3 jets 


>4 jets 




1760 


626 


N t -t 


245±20 


233±19 




1294±48 


321±30 




227±28 


70±12 



TABLE V: Sample composition from the likelihood fit. 



by the MC simulation and to have good discrimination 
power. For this analysis, we use the inclusive lepton+jets 
sample with the additional requirement that events with 
three jets must satisfy Pr(i) > 120 GeV. The events 
are divided into four channels defined by lepton flavor and 
jet multiplicity (3, >4). 

We determine the probability density functions of the 
likelihood discriminant for signal and prompt lepton 
backgrounds from the simulation and for events with- 
out prompt leptons from a control data sample. We 
perform a maximum likelihood fit to the likelihood dis- 
criminant spectra from data in all four channels simul- 
taneously with the it production cross section as a free 
parameter. The number of events without prompt lep- 
tons is constrained to the value obtained from the loose 
data sample in the same way as described above. Table fVl 
gives the sample composition for the best fit and Figured 
shows the corresponding likelihood discriminant distribu- 
tions. We measure <r tI = 6.62 ± 0.78(stat) ± 0.36(syst) ± 
0.40(lumi) pb for m t — 175 GeV. The systematic uncer- 
tainties are listed in Table IIIII in the same categories as 
for the 6-tag analysis plus Likelihood fit which gives the 
uncertainty from statistical fluctuations in the likelihood 
discriminant shapes from the MC simulation. 

We combine the two analyses using the BLUE 
method Their statistical correlation factor is 0.31, 
determined by MC generated pseudodata sets that model 
the statistical correlation between the two analyses. The 
systematic uncertainties from each source are completely 
correlated between both analyses. The combined result 
is a tJ = 7.42 ± 0.53(stat) ± 0.46(syst) ± 0.45(lumi) pb 
for m t — 175 GeV with % 2 = 2 for one degree of free- 
dom, corresponding to a p- value of 0.16. We use sam- 
ples of it events simulated with different values of the 
top quark mass to determine the cross section as a func- 




150 155 160 165 170 175 180 185 
top quark mass (GeV) 



FIG. 3: Comparison of measured cross section and theory 
prediction versus top quark mass. 



tion of top quark mass. A polynomial fit gives cr^/pb = 
7.42-7.9xl0- 2 Am+9.7xl0- 4 (Am) 2 -1.7xl0- 5 (Am) 3 , 
where Am = m t /GeV — 175, as shown in Figure [3] 

We define likelihoods as a function of a t j and m t for the 
theory prediction and our measurement. There are two 
sources of uncertainty in the calculated cross sections, 
a theory uncertainty that arises from the termination of 
the perturbative calculation and the uncertainty from the 
PDFs. For each value of mt, we represent the former by 
a likelihood function that is constant within the ranges 
given in Refs. 0, HJ and zero elsewhere and the latter by a 
Gaussian likelihood function with rms equal to the uncer- 
tainty determined in Ref. [2] for the CTEQ6M [H error 
PDF sets. We then convolute the two functions and av- 
erage the likelihood functions from the two calculations. 
The cross section measurement is represented by a Gaus- 
sian likelihood function centered on the measured value 
with rms equal to the total experimental uncertainty. We 
multiply the theory and measurement likelihoods to ob- 
tain a joint likelihood. The contour in Figure [3] shows the 
smallest region of the joint likelihood that contains 68% 
of its integral. We integrate over the cross section to get 
a likelihood function that depends only on the top quark 
mass. We find that at 68% C.L. m t = 170 ± 7 GeV, in 
agreement with the current world average of direct mea- 
surements of the top quark mass of 172.6 ± 1.4 GeV 21|. 

In conclusion, we find that it production in pp col- 
lisions agrees with standard model predictions. At the 
world average of direct top quark mass measurements of 
172.6 GeV we measure a t j — 7.62 ± 0.85 pb. This is the 
most precise measurement of the it production cross sec- 
tion. By comparing the cross section measurement with 
the theory prediction we determine the top quark mass 
to be 170 ± 7 GeV. 
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